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3' end formation of pre-mRNAs is coupled to their transcription via the C-terminal domain (CTD) of RNA 
polymerase II (Pol II). Nearly all protein-coding transcripts are matured by cleavage and polyadenylation (CPA), 
which is frequently misregulated in disease. Understanding how transcription is coordinated with CPA in human 
cells is therefore very important. We found that the CTD is heavily phosphorylated on Ser2 (Ser2p) at poly(A) (pA) 
signals coincident with recruitment of the CstF77 CPA factor. Depletion of the Ser2 kinase Cdkl2 impairs Ser2p, 
CstF77 recruitment, and CPA, strongly suggesting that the processes are linked, as they are in budding yeast. 
Importantly, we additionally show that the high Ser2p signals at the 3' end depend on pA signal function. 
Down-regulation of CPA results in the loss of a 3' Ser2p peak, whereas a new peak is formed when CPA is induced 
de novo. Finally, high Ser2p signals are generated by Pol II pausing, which is a well-known feature of pA site 
recognition. Thus, a reciprocal relationship between early steps in pA site processing and Ser2p ensures efficient 3' 
end formation. 
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Most eukaryotic pre-mRNAs are matured at their 3' end 
by cleavage and polyadenylation (CPA) (Proudfoot 2012). 
During CPA, several multiprotein complexes (CPSF, 
CstF, CFI, and CFII) assemble onto the polyj A) (pA) signal 
within the pre-mRNA, with many additional factors also 
implicated (Zhao et al. 1999; Shi et al. 2009). A pA signal 
usually consists of an AAUAAA hexamer followed by 
a U-rich or G/U-rich element, although variants of the 
hexamer motif are also functional (Tian and Graber 2012). 
Pre-mRNA cleavage occurs between these two elements 
and is performed by CPSF73 (Mandel et al. 2006). Fol- 
lowing cleavage, the upstream product is polyadenylated, 
and the 3' product is degraded (Kim et al. 2004; Gromak 
et al. 2006). The only pre-mRNA transcripts not poly- 
adenylated are those that code for replication-dependent 
histones. However, these transcripts use many of the same 
factors for cleavage of their 3' ends, including CPSF73 as 
the endonuclease (Dominski et al. 2005; Kolev and Steitz 
2005). 

CPA is closely coupled to transcription via the C-terminal 
domain (CTD) of RNA polymerase II (Pol II), which is 
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composed of repeats of the Tyr-Ser-Pro-Thr-Ser-Pro-Ser 
heptapeptide (McCracken et al. 1997). A dynamic binding 
platform for RNA biogenesis factors is created by cis/ 
trans isomerization of prolines and phosphorylation of 
the other amino acids (Buratowski 2009; Eick and Geyer 
2013). Of these modifications, Ser2 phosphorylation 
(Ser2p) is most strongly linked to 3' end formation. In 
budding yeast, the Ser2 kinase Ctkl is required for the 
cotranscriptional recruitment of several 3' end processing 
factors to Pol II (Ahn et al. 2004). In humans, Cdk9, Brd4, 
and Cdkl2 can phosphorylate CTD on Ser2, but Cdkl2 is 
the proposed ortholog of Ctkl (Peterlin and Price 2006; 
Bartkowiak et al. 2010; Devaiah et al. 2012). Similar to 
Ctkl, Cdkl2 does not generally affect transcription, al- 
though a subset of genes is regulated (Blazek et al. 2011). 
Whether Cdkl2 performs a function analogous to Ctkl in 
coordinating 3' end formation and transcription has not 
been tested. CTD phosphorylation status is also influ- 
enced by phosphatases, of which Fcpl acts on Ser2p (Cho 
et al. 2001). 

Although the importance of Ser2p for 3' end processing 
is well established in yeast, the relationship between 
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these two events is less well understood in mammals. 
However, Pol II lacking in Ser2 does not support efficient 
CPA in human cells (Gu et al. 2012). Moreover, Pol II that 
is most heavily phosphorylated on Ser2 often occurs as 
a peak at the pA site (Rahl et al. 2010; Brookes et al. 2012; 
Grosso et al. 2012; Hintermair et al. 2012). Global chro- 
matin immunoprecipitation (ChIP) as well as analysis of 
individual genes demonstrate that this correlates with 
sites of Pol II pausing and is most frequently found 
on short highly expressed genes with the canonical 
AAUAAA motif (Glover-Cutter et al. 2008; Grosso et al. 
2012). Pausing at pA sites is well characterized and 
mediated by capture of the emergent AAUAAA hexamer 
by CPSF bound to the body of Pol II (Nag et al. 2007). Pol II 
pausing can also occur over sequences downstream from 
the pA site, where it promotes transcriptional termina- 
tion and 3' end formation (Eggermont and Proudfoot 
1993; Gromak et al. 2006; West and Proudfoot 2009). 

The processes involved in recognizing pA sites are 
likely to be very relevant in the study of diseases such 
as cancer, in which there are widespread changes in pA 
signal usage (Mayr and Bartel 2009). When this is consid- 
ered with its near-ubiquitous role in mRNA biogenesis, 
it is important to understand it. A lot of research has 
focused on elucidating the function of CTD modification 
in RNA processing, but little is known about whether the 
CTD code is influenced by pre-mRNA processing events 
for their own benefit. We describe here reciprocal cou- 
pling between 3' end processing and Ser2p. This mecha- 
nism involves Pol n pausing that promotes Ser2p by Cdkl2, 
which serves to recruit CstF77 and is required for optimal 
3' end processing. 

Results 

Ser2p levels are increased beyond the MYC 

gene pA site 

We chose the human MYC gene as a model to analyze the 
coupling between 3' end formation and transcription (Fig. 
lA). It is highly expressed and was previously demon- 
strated to be a robust model to analyze Pol II association 
and processing factor recruitment (Glover-Cutter et al. 
2008). Chip was used to detect total Pol II (N20) or three 
of its well-characterized derivatives phosphorylated on 
CTD Ser5, Ser2, or Ser7 (Fig. IB). It is important to note 
that we used antibodies that are the most specific available 
to these CTD states (Chapman et al. 2007; Hintermair 
et al. 2012). All antibodies gave substantially more signal 
than an IgG control (Supplemental Fig. 1). Pol II is most 
concentrated at the promoter, with an additional pause at 
the 3' end. SerSp and Ser7p signals were also highest at the 
beginning of the gene correlative with functions at the 
promoter and in 5' capping (Komarnitsky et al. 2000; 
Schroeder et al. 2000; Schwer and Shuman 2011). Ser2p 
was low at the 5' end of the gene but showed a significant 
peak beyond the pA signal. This peak of Ser2p at or shortly 
after pA signals has been observed previously on the MYC 
gene and many other genes in yeast and mammals (Glover- 
Cutter et al. 2008; Rahl et al. 2010; Preker et al. 2011; 



Bataille et al. 2012; Brookes et al. 2012; Grosso et al. 2012; 
Hintermair et al. 2012). While correlative with a function 
in CPA, why and how Ser2p is enriched at this position is 
not established in humans. 

We also performed ChIP to monitor the recruitment 
of the CPA factors CPSF30 and CstF77 to the MYC gene 
(Fig. IC). CPSF30 was detected at all positions, which is 
consistent with the idea that CPSF may join Pol II at the 
promoter (Dantonel et al. 1997; Nag et al. 2007; Glover- 
Cutter et al. 2008). In contrast, CstF77 showed enrich- 
ment beyond the pA signal, consistent with previous 
reports that it is recruited at the end of the transcription 
cycle (Glover-Cutter et al. 2008; Davidson and West 
2013). This experiment suggests that some members of 
the CPA machinery associate early with Pol II. However, 
complete assembly of a functional CPA complex does not 
occur until after the pA signal has been transcribed and is 
coincident with the highest Ser2p signal. 

The 3' peak of Ser2p could be explained by enhanced 
phosphorylation at those positions. Alternatively, Ser2p 
might already be present at upstream regions but in a 
state that is masked from detection. These possibilities 
are important to distinguish in order to properly interpret 
the Chip data. To do so, we performed native ChIP on the 
MYC gene in the presence of high salt and SDS. We 
predicted that this would unmask any Ser2p that is bound 
by factors or obscured within CTD folding. Consistently, 
CstF77 was undetectable under these conditions (Supple- 
mental Fig. 2). However, Ser2p should remain detectable, 
as Pol II is very tightly bound to DNA, aided by its so- 
called clamp region (Gnatt et al. 2001). To test this, we 
performed the experiment using the Ser2p antibody (Fig. 
ID). Significantly, the Ser2p profile was very similar to 
that obtained by standard ChIP. To confirm the peak at 
the pA site, an extra primer pair was used spanning this 
region (pA). This experiment strongly suggests that Ser2p 
is increased beyond the pA site rather than being ob- 
scured before it. 

Cdkl2 activity is important for Ser2p 
and 3' end formation 

In budding yeast, most Ser2p is achieved by Ctkl func- 
tion and is required for the recruitment of CPA factors to 
genes (Ahn et al. 2004). Although CTD lacking Ser2 
cannot efficiently recruit the CPA factor Pcf 1 1 in humans 
(Gu et al. 2012), the kinase involved has not been in- 
vestigated. Recently, Cdkl2 was shown to promote Ser2p 
in metazoans and is the proposed ortholog of Ctkl 
(Bartkowiak et al. 2010; Blazek et al. 201 1 ). We used RNAi 
to deplete Cdkl2 from HeLa cells so that we could study 
its function in Ser2p and 3' end formation. 

Following confirmation of successful Cdkl2 depletion 
(Supplemental Fig. 3 A), we performed ChIP on control 
and Cdkl2 knockdown cells using the N20 and Ser2p and 
antibodies (Fig. 2A). Little difference in the Pol II profiles 
was evident, but we observed a reduction in Ser2p signals 
after the pA signal. This suggests that Cdkl2 plays an 
important role in establishing the Ser2p profile across 
genes. Similarly, the CstF77 enrichment beyond the pA 
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Figure 1. [A] Diagram of the human MYC gene 
with exons (black boxes) and the pA site indicated. 
Regions assayed by ChIP are underhned and la- 
beled, and a scale marker is shown. [B] ChIP exper- 
iment assaying the distribution of Pol U (N20), SerSp, 
Ser2p, and Ser7p across the MYC gene. For each 
antibody, values are normalized to the maximum 
value (see the Materials and Methods for details). 
(NTS) A negative control region. (C) ChIP experi- 
ment assaying the distribution of CPSF30 and 
CstF77 across the MYC gene. For each antibody, 
values are normalized to the maximum value. (NTS) 
A negative control region. [D] Native ChIP analyzing 
Ser2p across the MYC gene. Values are normalized 
to the maximum value. (NTS) A negative control 
region. All error bars represent standard deviation 
from at least three biological replicates. 
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site was reduced upon depletion of Cdkl2 (Fig. 2B). The 
recruitment of CPSF30, which has been proposed to 
bind to the body of Pol II in addition to the CTD, was 
unaffected by Cdkl2 depletion as measured by ChIP 
(Supplemental Fig. 3B). These experiments confirm that 
Cdkl2 phosphorylates Ser2p and suggest that this is 
important to recruit some 3' end formation factors,- 
namely, CstF77. This mechanism is analogous to that 
in budding yeast (Ahn et al. 2004). Finally, we analyzed 
the recruitment of Cdkl2 to the MYC gene by ChIP (Fig. 
2C). Cdkl2 was present over all regions, and the signal 
was specific because it was reduced following treatment 
with Cdkl2 siRNA. Cdkl2 is not enriched over the pA 
site in comparison with upstream regions, suggesting that 
the high Ser2p at this position depends on modulation of 
Cdkl2 activity. 



We next wanted to establish whether impaired Ser2p 
and CstF77 recruitment was important for overall 3' end 
processing efficiency. We isolated total RNA from control 
and Cdkl2-depleted cells and reverse-transcribed it with 
random hexamers. cDNA was then real-time PCR-am- 
plified with primers spanning the MYC and GAPDH gene 
pA site and downstream flanking regions such that only 
unprocessed RNA was detected (Fig. 2D). More of these 
unprocessed transcripts were detected following Cdkl2 
depletion, indicating that 3' end processing is more 
efficient when coupled to Ser2p by Cdkl2. To gain a greater 
insight into this defect, we analyzed the rate of Myc pA site 
processing (Fig. 2E). This was done by monitoring non- 
pA-cleaved pre-mRNA following transcription inhibition 
by Actinomycin D (Act D) using primers spanning the pA 
cleavage site. In this experiment, cleavage at the pA site is 
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Figure 2. (A) ChIP experiment assaying the level and distribution of Pol II (N20) and Ser2p across the MYC gene in control or Cdkl2- 
depleted cells. Values are normalized to the maximal value obtained in control siRNA-treated cells. [B] ChIP experiment assaying the 
level and distribution of CstF77 across the MYC gene in control or Cdkl2-depleted cells. Values are normalized to the maximal value 
obtained in control siRNA-treated cells. (C) ChIP experiment assaying the level and distribution of Cdkl2 across the MYC gene in 
control or Cdkl2-depleted cells. IgG control values are also shown. Values are normalized to the maximal value obtained in control 
siRNA-treated cells. (D) Real-time PCR analysis of RNA not yet pA-cleaved (UCPA) and readthrough (Flank) RNA from the MYC and 
GAPDH genes in control and Cdkl2-depleted samples. The diagram shows the final intron, pA site, and 3' flank. Quantitation shows 
a fold change in Cdkl2-depleted cells relative to control cells, normalized to terminal intron exon junction RNA. {*] P < 0.05. (£) Act D 
time-course analysis of non-pA-cleaved (UCPA) RNA in control or Cdkl2-depleted cells. RNA levels were normalized to those present 
at time 0. [F] 4thio-UTP nuclear run-on (NRO) analysis of Myc transcriptional termination in control and Cdkl2-depleted cells. The 
graph shows the fold change in cells depleted of Cdkl2 compared with control cells. Signals were normalized to those from the intron 
2-exon 3 junction. (*) P < 0.05. All error bars represent standard deviation from at least three biological replicates. 
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predicted to cause a progressive loss of this product. 
When compared with control cells, Cdkl2 depletion 
reduced the rate at which this species was lost, which is 
consistent with slower pA cleavage. A modest reduction 
in polyadenylated Myc RNA was also seen (Supplemental 
Fig. 4A,B). Importantly, exosome depletion did not alter 
the rate at which non-pA-cleaved Myc RNA was depleted, 
confirming that its loss is due to processing rather than 
degradation of the precursor (Supplemental Fig. 4C). 
Moreover, the rate of Myc splicing was unaffected by 
Cdkl2 depletion, arguing that 3' end formation is partic- 
ularly sensitive to the presence of the protein (Supple- 
mental Fig. 4E). 

Defects in 3' end processing reduce the efficiency of 
transcriptional termination (Whitelaw and Proudfoot 1986; 
Dye and Proudfoot 2001; Gromak et al. 2006). To test 
whether this was also a consequence of Cdkl2 depletion, 
we performed a nuclear run-on (NRO) analysis of Myc 
transcripts (Fig. 2F). NRO detects RNA only from actively 
transcribing Pol II, and signals observed are reflective of 
the number of such polymerases at a given position on 
the template. Nuclei from control and Cdkl2-depleted 
cells were incubated with 4-thio UTP to label nascent 
transcripts, which were subsequently biotinylated and 
purified before analysis by quantitative RT-PCR. We 
analyzed RNA over the pA site (UCPA) as well as over 
three downstream readthrough regions (Fl, F2, and F2A). 
Importantly, we observed significantly more readthrough 
RNA over F2A in Cdkl2-depleted cells as compared with 
control cells, confirming that full levels of Cdkl2 are 
required for efficient Pol II termination in this case. 

A functional pA site promotes high levels of Ser2p 

Although our study focuses on a few model genes, a peak 
of Ser2p at the pA site is a common and evolutionarily 
conserved feature of protein-coding genes (Rahl et al. 
2010; Bataille et al. 2012; Brookes et al. 2012; Grosso et al. 
2012; Hintermair et al. 2012). Moreover, on Caenorhabditis 
elegans operons, Ser2p peaks are found at each internal 3' 
end processing signal even though only a single pre-mRNA 
precursor is synthesized (Garrido-Lecca and Blumenthal 
2010). These data argue that an aspect of CPA is involved 
in generating these high Ser2p signals to augment 3' end 
formation. If this is the case, then mutation of a pA signal 
should disrupt the accumulation of Ser2p at the 3' end of 
a gene. 

To test this hypothesis, we used two HEK293 cell lines 
containing single-copy integrations of either the human 
P-globin gene (pWT) or a version with a mutated pA site 
(PpAm) (Davidson and West 2013). Expression of both 
genes was driven by a tetracycline-inducible CMV pro- 
moter (Fig. 3A). We performed ChIP on these p-globin 
genes to detect Pol II or its Ser2p derivative (Fig. 3B). 
Results were plotted to reflect the Ser2p levels normal- 
ized to Pol II levels. In the case of pWT, the proportion of 
Ser2p increased toward the 3' end of the p-globin gene, 
similar to the MYC gene. In the case of ppAm, Ser2p 
density was highest at the 5' end and had a generally 
flatter profile. Why the signal for Ser2p is slightly elevated 



at the 5' end for PpAm is not clear but could reflect the 
disruption in processing. However, another ChIP experi- 
ment on integrated p -glob in genes with normal and 
mutated pA sites gave a similar result, indicating that a 
pA site mutation disrupts the Ser2p pattern (Mapendano 
et al. 2010). 

We next performed the reciprocal experiment by asking 
whether de novo CPA would result in a corresponding 
peak of Ser2p. To test this hypothesis, we took advantage 
of the recently described function of Ul snRNA in in- 
hibiting premature CPA (PCPA) (Kaida et al. 2010; Berg 
et al. 2012). A prominent example of this occurs in the 
NR3C1 pre-mRNA, which is subject to PCPA upon in- 
hibition of Ul by an antisense morpholino (AMO). PCPA 
is rapidly induced following Ul inhibition, and an abun- 
dant product can be observed after 3 h (Davidson and 
West 2013). This was confirmed by 3' RACE performed 
on cells treated with control or Ul AMOs (Fig. 3C). 

To see whether this PCPA event affected CTD phos- 
phorylation, we performed ChIP analysis on NR3C1 with 
N20, SerSp, Ser2p, and Ser7p antibodies in control and Ul 
AMO-treated cells (Fig. 3D). Primers were used to analyze 
recruitment at positions between the promoter and 2 kb 
beyond the PCPA site. The overall Pol II profile was similar 
in both conditions over the regions assayed, but termina- 
tion of transcription eventually occurred at downstream 
positions (Supplemental Fig. 5). There was little change in 
either SerSp or Ser7p in Ul AMO samples compared with 
control treatment, showing that PCPA does not influence 
these modifications. Strikingly, the main difference be- 
tween control and Ul AMO treatments was a dramatic 
increase in the Ser2p signal beyond the PCPA site in Ul 
AMO samples compared with the control. Importantly, 
a U2 AMO that inhibits splicing but does not promote 
PCPA did not affect the level of Ser2p (Supplemental 
Fig. 6). Thus, activation of a pA site induces high Ser2p, 
providing more support for a function of CPA in establish- 
ing the pattern of Ser2p across genes and, in particular, in 
promoting high levels of Ser2p near functional pA sites. 

CPSF73 depletion reduces the level of Ser2p 
on transcribed genes 

To see whether trans-2ictmg factors also mediate CPA- 
dependent Ser2p, we used RNAi to deplete the pA endonu- 
clease CPSF73 from HeLa cells. After confirming successful 
protein knockdown (Fig. 4A), we tested that the depletion 
was sufficient to observe a 3' end processing defect. Total 
RNA was isolated from control and CPSF 73 -depleted cells 
and reverse-transcribed with random hexamers. Next, 
primers spanning MYC or GAPDH gene pA sites were 
used to perform real-time PGR and detect transcripts on 
which CPA had not yet occurred (Fig. 4B). CPSF73 de- 
pletion resulted in a threefold to fourfold increase in the 
level of these species, establishing that protein levels 
were sufficiently low to inhibit CPA. Consistently, Act D 
time-course analysis revealed a substantial reduction in 
the rate at which Myc transcripts were pA-cleaved (Fig. 
4C), and fewer polyadenylated Myc transcripts were pro- 
duced folloMring CPSF73 depletion (Supplemental Fig. 7A,B). 
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Figure 3. [A] Diagram of pWT and PpAm genes integrated into HEK293 cells. The tetracycline-inducible CMV promoter (arrow), exons 
(black boxes), and pA site are indicated. Regions assayed by ChIP are underlined. [B] ChIP across pWT and PpAm to detect the relative ratio 
of Ser2p as compared with the Pol II (N20) signal. Note that the signal differential between Exl and 3'B is significant (P < 0.05) for pWT but 
not for PpAm. (C) 3' RACE detection of PCPA of NR3C1 pre-mRNA in total RNA from control and Ul AMO-treated cells. The top panel 
shows the PCPA product, and the bottom panel shows product from the intronless TAF7 gene that acts as a loading control. The diagram 
shows the NR3C1 gene and a zoomed region showing primer pairs to detect PCPA within intron 2. [D] ChIP across the 5' portion of the 
NR3C1 gene to detect the distribution and level of Pol II, SerSp, Ser7p, and Ser2p in control or Ul AMO-treated cells. The diagram shows 
the 5' portion of the NR3C1 gene, and ampHcons are indicated and underlined. For each antibody, values are normalized to the maximal 
value obtained in control AMO-treated cells. AU error bars represent standard deviation from at least three biological replicates. 



Next, we tested the effect that reduced CPA has on Pol 
II loading and Ser2p. ChIP was performed on both con- 
trol and CPSF73-depleted cells using the N20 and Ser2p 
antibodies (Fig. 4D). CPSF73 RNAi caused little change in 



Pol II occupancy, as determined by the N20 signal. Fiow- 
ever, its depletion resulted in a more dramatic effect on 
Ser2p, which was reduced at all positions tested but in 
particular over the region beyond the pA site. The same 
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Figure 4. [A] Western blotting of protein extracts from cells treated with control or CPSF73 siRNAs. The top panel shows CPSF73 
protein, and the bottom panel shows Tubulin loading control. [B] Real-time PGR analysis of RNA not yet pA-cleaved (UCPA) from the 
MYC and GAPDH genes in control and CPSF73-depleted samples. Quantitation shows a fold change in CPSF73-depleted cells relative 
to control cells normalized to the terminal intron-exon junction. {*] P < 0.05. (C) Act D time-course analysis of non-pA-cleaved (UCPA) 
RNA in control or CPSF73 -depleted cells. RNA levels were normalized to those present at time 0. (D) ChIP experiment assaying the 
level and distribution of Pol II (N20) and Ser2p across the MYC gene in control or CPSF73-depleted cells. Values are normalized to the 
maximal value obtained in control siRNA-treated cells. (£) Real-time PCR analysis of histone HIE RNA 3' end processing in control, 
CPSF73-depleted, or Cdkl2-depleted samples. The primer pairs used are indicated on the accompanying diagram. Quantitation shows 
a fold change relative to control cells normalized to signal over the Histone gene body (5') (*) P < 0.05. [F] ChIP experiment assaying the 
level and distribution of Pol II (N20) and Ser2p across the histone HIE gene in control or CPSF73-depleted cells. Values are normalized 
to the maximal value obtained in control siRNA-treated cells. The diagram shows the positions of amplicons. All error bars represent 
standard deviation from at least three biological replicates. 



was true for GAPDH and ACTB genes (Supplemental 
Fig. 7C). Finally, Cdkl2 protein levels were equivalent in 
control and CPSF73-depleted cells, arguing that its indi- 
rect depletion was not responsible (Supplemental Fig. 7D). 
These data provide further evidence that efficient 3' end 
formation is important to establish high levels of Ser2p by 
implicating CPSF73 in the process. 



Histone RNA processing defects are not associated 
with changes in Ser2p level 

CPSF73 catalysis is also a crucial part of histone pre- 
mRNA processing, although these RNAs are not poly- 
adenylated (Dominski et al. 2005; Kolev and Steitz 2005). 
We were interested in whether there was also reciprocal 
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coupling between 3' end formation of histone transcripts 
and Ser2p or the effect was pA site-specific. Using histone 
HIE RNA as a model, we sought to confirm the role of 
CPSF73 in histone RNA processing in vivo and test for 
involvement of Cdkl2. We performed quantitative RT- 
PCR on total RNA from control, CPSF73-depleted, and 
Cdkl2-depleted cells using primers spanning the site of 3' 
end formation (Fig. 4E). We observed a substantial pro- 
cessing defect in CPSF 73 -depleted cells, in line with its 
anticipated role in histone RNA maturation. Impaired 
processing was also observed in Cdkl2-depleted samples, 
indicating that Ser2p may also be important. 

To investigate whether 3' end processing of histone 
RNA by CPSF73 was involved in promoting Ser2p, we 
performed ChIP on control and CPSF73 -depleted cells 
using Pol II and Ser2p antibodies (Fig. 4F). However, there 
was little difference in the level or distribution of Pol II 
or its Ser2p derivative in both conditions. Thus, 3' end 
processing of histone RNA and CPSF 73 catalysis per se 
does not promote elevated Ser2p levels in a way that is 
observable by ChlP. It should be noted that Ser2p levels 
are much lower over Histone HIE as compared with 
MYC, which may reflect the distinct mechanisms of 3' 
end formation (Supplemental Fig. 8). 

CPA does not impact on Ser2p levels 

We next wanted to determine the step in CPA that 
promotes Ser2p. For this purpose, we subdivided CPA 
into three stages: Pol II pausing and pA site recognition, 
pA site cleavage, and polyadenylation. Polyadenylation 
was tested first, as it distinguishes pA site processing 
(which we found promotes Ser2p) from histone RNA 
processing (which we found does not). Accordingly, we 
compared Pol II and Ser2p ChIP signals on the MYC gene 
in control cells or cells treated with cordycepin (CDY), 
which inhibits polyadenylation (Fig. 5A). As previously 
documented (Anamika et al. 2012), CDY caused a mild 
reduction of Pol n signal at the promoter,- however, the level 
and distribution of Ser2p was not significantly changed. 
Thus, polyadenylation does not positively impact on Ser2p. 

We next asked whether pA site cleavage promotes Ser2p. 
This is not a trivial issue to address because separating 
pA site recognition from cleavage is very difficult. How- 
ever, we reasoned that if pA cleavage were a prerequisite 
for high Ser2p levels, then non-pA-cleaved RNA would 
only be found in association with Pol II that was not 
heavily phosphorylated on Ser2. This would be in con- 
trast to the situation by ChIP, where Ser2p peaks at the 
pA site. To test this, we performed RNA immunoprecip- 
itation (RNA-IP) of Myc transcripts with the Pol II and 
Ser2p antibodies (Fig. 5B). We analyzed unspliced intron 1 
(Exl-Inl), unspliced intron 2 (Ex2-In2), non-pA-cleaved 
(UCPA), and 3' flanking region (Fl) transcripts. Similar 
amounts of each RNA were precipitated with the Pol II 
antibody, but when the Ser2p antibody was used, UCPA 
and Fl transcripts were immunoprecipitated more effi- 
ciently than those upstream, consistent with the ChIP 
data. If high Ser2p occurred only after pA site cleavage, it 
might have been expected that no Ser2p peak would be 



seen by RIP or that, if it was, UCPA transcripts would not 
be associated with it. This result therefore suggests that 
high Ser2p can occur on Pol II associated with transcripts 
not yet pA-cleaved. In support of this, direct visualization 
of transcription by other groups found that most Pol II 
present beyond pA sites is associated with RNA that was 
yet to be cleaved (Osheim et al. 1999, 2002). 

As a further assessment of the role of pA site cleavage 
in Ser2p, we used RNAi to deplete cells of CstF77 (Fig. 
5C). This factor is important for pA site cleavage, but our 
findings suggest that it is most efficiently recruited after 
Ser2 becomes highly phosphorylated. We first tested 
whether our depletion was sufficient to impair cleavage 
of the MYC and GAPDH gene pA sites using RT-PCR on 
RNA from control and CstF77-depleted cells (Fig. 5D). 
Uncleaved pA sites from both genes accumulated twofold 
to threefold following CstF77 depletion, confirming its 
function in CPA. This was accompanied by a reduced rate 
of Myc pA cleavage as measured by Act D time-course 
analysis (Fig. 5E) and a reduction in the number of 
polyadenylated Myc transcripts (Supplemental Fig. 9). 
Finally, we performed ChIP under both conditions to 
detect Pol II and Ser2p over the MYC gene (Fig. 5F). Im- 
portantly, even though CstF77 depletion impaired pA site 
cleavage, it did not affect Ser2p levels, indicating that the 
two processes can be uncoupled. These data further argue 
that Ser2p is promoted by a step prior to pA site cleavage. 

Early events in 3' end formation promote Ser2p 

We next considered the possibility that pausing over the 
pA site might be responsible, since the 3' peak of Ser2p 
that is observed genome-wide is coincident with a peak of 
paused Pol II (Grosso et al. 2012). Furthermore, there is 
little or no Pol 11 pausing on histone genes and no re- 
ciprocal coupling of histone HIE RNA processing and 
Ser2p, which is present at low levels (Fig. 4F; Supplemental 
Fig. 8; Anamika et al. 2012). 

It has been shown that binding of CPSF30 by non- 
structural protein lA (NSIA) from influenza virus abro- 
gates AAUAAA-dependent Pol II pausing (Nag et al. 2007). 
NSIA binds to CPSF30 and prevents the interaction of 
CPSF complexes with pre-mRNA (Nemeroff et al. 1998). 
We made two HEK293 cell lines containing a stably 
integrated single copy of the wild- type NSIA gene or a 
mutant version encoding NSIA that cannot bind CPSF30. 
Transcription of both genes was driven by a tetracycline- 
inducible promoter to allow rapid and coordinated ex- 
pression (Supplemental Fig. lOA). This system is advan- 
tageous over RNAi, since CPSF30 activity and 3' end 
formation are more rapidly inhibited, and the mutant 
that does not bind to CPSF30 is a robust control. 

To test the system, total RNA was isolated from wild- 
type and mutant NSIA cells induced with tetracycline. 
Following random hexamer-primed reverse transcription, 
uncleaved Myc and GAPDH pA sites were quantitated by 
real-time PGR (Fig. 6A). Twofold to threefold more of 
these species were present in cells expressing wild-type 
NSIA when compared with those expressing mutant 
NSIA, confirming that inhibition of CPSF30 impairs 
CPA. Consistently, wild- type NSIA also impaired the 
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Figure 5. {A) ChIP analysis of the level and distribution of Pol II and Ser2p in cells treated with DMSO or the polyadenylation inhibitor 
CDY. Values are normalized to the maximal value obtained in DMSO-treated cells. [B] RNA-IP experiment using Pol II (N20) and Ser2p 
antibodies detecting unspliced Myc intron 1 (Exl-Inl) and intron 2 (Ex2-In2) as well as non-pA-cleaved (UCPA) and flank (Fl) RNA. For 
each antibody, values are normalized to the maximal value obtained. (C) Western blot analysis of CstF77 {top panel) and PABPNl 
(loading control; bottom panel) in cells treated with control and CstF77 siRNAs. [D] Real-time PGR analysis of RNA not yet pA-cleaved 
(UCPA) from the MYC and GAPDH genes in control and CstF77-depleted samples. Quantitation shows a fold change in CstF77- 
depleted cells relative to control cells, normalized to the terminal intron-exon junction. (*) P < 0.05. (£) Act D time-course analysis of 
non-pA-cleaved (UCPA) RNA in control or CstF77-depleted cells. RNA levels are relative to those present at time 0. [F] ChIP 
experiment assaying the level and distribution of Pol II (N20) and Ser2p across the MYC gene in control or CstF77-depleted cells. Values 
are normalized to the maximal value obtained in control siRNA-treated cells. All error bars represent standard deviation from at least 
three biological replicates. 



recruitment of CPSF30 to the MYC gene (Supplemental 
Fig. lOB). We next tested the effects of NSIA on the level 
and distribution of Pol II and its Ser2p derivative (Fig. 6B). 
Results for Pol II were similar in both cell lines, but Ser2p 



levels were depleted at the 3' end of MYC in cells 
expressing wild- type NSIA as compared with mutant 
NS 1 A. This experiment provides further support for a role 
of CPSF and Pol II pausing in promoting high levels of 
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Figure 6. [A] Real-time PGR analysis of Myc or GAPDH RNA that is not cleaved at the pA site (UCPA) in cells expressing wild-type or 
mutant NSIA. Quantitation shows a fold change in wild- type NSIA cells relative to mutant NSIA cells normalized to the terminal 
intron-exon junction. {*) P < 0.05. [B] ChIP experiment assaying the level and distribution of Pol II (N20) and Ser2p across the MYC gene 
in wild-type (WT) or mutant (Mut) NSlA-expressing cells. Values are normalized to the maximal value obtained in mutant NSIA cells. 
Primer pairs used are underlined on the accompanying diagram. (C) RT-PCR determination of Pol II pausing at the Myc gRNA. Levels 
of RNA from upstream (US) of or downstream (DS) from the gRNA-binding site were compared and plotted. The graph shows the 
DS:US ratio in cells expressing control or Myc-specific gRNA. The diagram shows primers and the position to which the Cas9:gRNA 
complex is targeted (gray circle). {*) P < 0.05. (D) ChIP analysis of the level and distribution of Pol II (N20) and Ser2p across the MYC 
gene in cells transfected with a dCas9 expression construct in addition to a plasmid expressing control or MYC-specific gRNA. Values 
are normalized to the maximal value obtained in control gRNA transfected cells. All error bars represent standard deviation from at 
least three biological replicates. 



Ser2p at 3' ends of genes. CstF77 recruitment was also 
reduced by expression of wild- type NSIA (Supplemental 
Fig. lOB). 

Pol II pausing is sufficient to promote Sei2p 

The above data are consistent with a function for pA site 
pausing in promoting Ser2p. To validate this model, we 
sought to promote a single site-specific Pol II pause in 
a region where Ser2p was normally low and test whether 
it was increased as a consequence. We used the Strepto- 
coccus pyogenes type II CRISPR system that has recently 
proved useful for genome modification in human cells 
(Cong et al. 2013; Mali et al. 2013). Briefly, a guide RNA 
(gRNA) is used to target an endonuclease, Cas9, to a 
chosen part of the genome. If a catalytically dead Cas9 
(dCas9) is expressed, the complex remains bound to the 
DNA, where it impedes Pol II progress (Gilbert et al. 2013; 



Qi et al. 2013). To target this complex to the MYC gene, 
we made a construct expressing a gRNA directed close to 
the intron 1 exon 2 boundary of the MYC gene, where 
Ser2p is lower than at the 3' end of the gene. To test 
whether the MYC gRNA could restrict Pol II elongation, 
cells were transfected with a dCas9 expression plasmid 
and the MYC-specific or control gRNA constructs. Total 
RNA was isolated and reverse-transcribed before being 
real-time PCR-amplified with primers upstream of and 
downstream from the gRNA-binding site (Fig. 6C). We 
then calculated the ratio of downstream to upstream 
RNA. When compared with cells expressing the control 
gRNA, significantly less downstream RNA was recovered 
in cells expressing the MYC gRNA. This indicates that 
a proportion of Pol II is paused at the gRNA-binding site. 

Finally, we performed ChIP on cells expressing dCas9 
with either the control or MYC gRNA to detect Pol II or 
Ser2p (Fig. 6D). When compared with the control gRNA, 
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the MYC gRNA caused a modest accumulation of Pol II 
at the end of intron 1, where otherwise the pattern was 
similar. This is consistent with the expected obstruction 
to Pol II elongation provided by gRNA:dCas9. It should 
be noted that not all MYC loci are likely to be bound 
by gRNA:dCas9 due to incomplete targeting evident in 
transfection-based genome-editing experiments (Wang 
et al. 2013). Importantly, however, the MYC gRNA in- 
duced a more substantial peak of Ser2p over this region. 
These experiments strongly support the idea that Pol II 
pausing can promote Ser2p. 

Transcripts extending beyond the region of high 
Ser2p are inefficiently cleaved at the pA site 

We finally wanted to test whether the reciprocal coupling 
that we describe is important for efficient CPA. Accord- 
ingly, the rate of 3' end formation on transcripts from the 
region of high Ser2p was compared with that of down- 
stream RNAs produced by Pol 11 that escapes pausing. Pol 
n over this downstream region (F3) displayed lower Ser2p 
than that over upstream Fl and F2 regions (Fig. 7 A). To 
assay processing rates, we inhibited transcription using 
Act D and isolated total RNA samples at 10-min intervals 
over a 40-min period. Following reverse transcription with 
random hexamers, real-time PCR was used to quantitate 
RNA from Fl, F2, and F3 (Fig. 7B). Both Fl and F2 transcripts 
were strongly depleted after only 10 min, which indicates 
that pA cleavage takes place very quickly. In contrast, F3 
RNA remained stable throughout the time course, indi- 
cating that transcripts extending to this region were very 
inefficiently processed compared with those upstream. 

If the loss of Fl and F2 signal is a consequence of CPA, it 
should be prevented when the process is impaired. To test 
this, we repeated the Act D time course but analyzed the 
decay of Fl, F2, and F3 in cells expressing wild- type and 
mutant NSIA protein (Fig. 7C). The decay in mutant 
NSIA cells was rapid, as before,- however, it was much 
slower in cells expressing the wild-type protein, which 
inhibits CPA. This result confirms that the low levels of 
Fl and F2 transcripts observed following Act D treatment 
were due to functional 3' end processing. There was little 
difference between wild- type and mutant NSIA samples 
assayed for F3, since transcripts extending to this region 
are already poorly processed. In sum, these data strongly 
suggest that the mechanism that we describe promotes 
efficient CPA. 

Discussion 

We investigated how transcription, CTD modification, 
and CPA are linked in human cells. We propose that a 
reciprocal relationship between Ser2p and pausing during 
CPA coordinates 3' end formation of pre-mRNAs and 
transcription. We suggest that Pol II pausing, mediated by 
the pA signal and CPSF, promotes high levels of Ser2p by 
Cdkl2. This augments the recruitment of CstF to form 
a functional CPA complex to execute 3' end formation. 
This mechanism is advantageous for 3' end formation, as 
pA site cleavage is inefficient if Pol II reads through the 
region of high Ser2p. Our model is presented in Figure 7D. 



Our data indicate that the effect of CPA on Ser2p is 
mediated by Pol II pausing, which is a well-described 
consequence of pA signal transcription (Orozco et al. 2002; 
Nag et al. 2007; Grosso et al. 2012). We argue this for 
several reasons: First, the Ser2p signal increased over the 
pA signal by ChIP and RNA-IP experiments, showing that 
template-bound Pol II that is heavily phosphorylated on 
Ser2 is associated with non-pA-cleaved RNA. Second, we 
found that there is no effect of CPSF73 depletion on Ser2p 
on the histone HIE gene even though processing is 
impaired. This suggests that CPSF 73 catalysis per se and 
therefore pre-mRNA cleavage are not involved in Ser2p. 
Consistently, it has been proposed that Pol II pausing is 
transient or absent on histone genes (Anamika et al. 2012). 
Third, we found that CstF77 depletion does not impact on 
Ser2p even though pA site cleavage is impaired. Fourth, 
site-specific Pol n pausing induced by dCas9 is associated 
with a Ser2p peak even though no pA signal is present. 
Finally, direct visualization of Pol II transcription by 
electron microscopy indicates that Pol II that is associated 
with pA-cleaved RNA is rarely seen either in Drosophila 
or on human minigenes in Xenopus (Osheim et al. 1999, 
2002). This being the case, the majority of the ChIP signal 
beyond pA signals is likely to be contributed by Pol 11 that is 
associated with unprocessed pA sites. 

Our native ChIP experiment strongly suggests that 
Ser2p is added beyond the pA site rather than being 
present upstream but masked by bound factors. However, 
it should be noted that CTD repeats with Tyrlp, SerSp, or 
Ser7p can be poorly detected by this Ser2p antibody in 
vitro (Chapman et al. 2007; Hintermair et al. 2012). 
Therefore, a Ser2p peak at the 3' end may result from 
the loss of one or more of these inhibitory marks instead 
of the gain of Ser2p. This would be reflected by a sharp pA 
signal-dependent reduction in Tyrlp, SerSp, or Ser7p, 
but, to our knowledge, this has never been described in 
humans. Finally, the H5 antibody is often used to detect 
Ser2p but is less specific than the 3E10 antibody that we 
used, as it reacts strongest with CTD phosphorylated on 
Ser5 and Ser2 (Chapman et al. 2007). Even so, ChIP 
sequencing with H5 or 3E10 generates very similar 
average profiles, with peaks of signal at or beyond the 
pA signal (Rahl et al. 2010; Brookes et al. 2012; Grosso 
et al. 2012; Hintermair et al. 2012). As we propose, these 
combined data support a straightforward model in which 
Ser2p is increased after the pA site. 

We showed that a transcriptional pause can act as a 
molecular trigger for Ser2p, and a matter for future study 
will be to understand how pausing leads to enhanced 
Cdkl2 activity. The ChIP experiments in Figure 2 dem- 
onstrate that human Cdkl2 is present across the MYC 
gene. This is very similar to what was observed with 
Drosophila Cdkl2, which was detected broadly across 
several tested genes (Bartkowiak et al. 2010). Pausing 
might trigger Ser2p by localized recruitment of Cdkl2 or 
by promoting its activity. Our results favor the latter 
because, although Cdkl2 is present at the 3' end of the 
MYC gene, there is not a striking peak. A further possibil- 
ity is that Ser2p is repressed by a phosphatase that is 
inactivated in a 3' end processing-dependent manner. We 



352 GENES & DEVELOPMENT 



Transcription-coupled 3' end formation 



F1 



F2 



F3 



pA 



o 

D_ 

a5 



_c5 

0) 
DC 



Ser2p 




B 



CD 
> 
0) 



DC 



_c5 
CD 
DC 



DC 



1 

0.8 
0.6 i 
0.4 
0.2 
0 



1.2 n 
1 

0.8 
0.6 
0.4 

0.2 j 
0 




■F1 
■F2 

F3 



0 10 20 40 

Time (mins) 




■F1 mutNSIA 
F1 wt NS1A 

■F2 mut NS1A 
F2wtNS1A 
F3 mut NS1A 
FSwtNSIA 



0 10 20 40 

Time (mins) 





Ser2p density 

Figure 7. [A] Graph showing the relative ratio of Ser2p ChIP signal as compared with total Pol II (N20) over the Fl, F2, and F3 regions. 
Note that Ser2p/N20 is lowest for the F3 region (values normalized to those for the Ex2 amplicon). The diagram shows the position of 
amplicons, and the region of high Ser2p is also indicated. [B] Act D time-course analysis of RNA from the Fl, F2, or F3 regions of the 
MYC gene 3' flank. For each product, values are plotted relative to those obtained at the 0 time point following normalization to U6 
snRNA. (C) Act D time-course analysis of RNA from the Fl, F2, and F3 regions of the MYC gene 3' flank in cells expressing wild-type 
(wt) or mutant (mut) NSIA. For each product, values are plotted relative to those obtained at the 0 time point following normalization 
to U6 snRNA. [D] Model: Upstream of the pA signal, Ser2p is present at lower levels, and CPSF accompanies Pol II. Transcription of the 
pA site induces Pol II pausing (open red circle with slash) involving CPSF, and this triggers Ser2p by Cdkl2. As a result, CstF77 is 
recruited, and CPA takes place efficiently. Ser2p density is indicated by a color gradient of pink (lower) to red (higher) on the CTD. All 
error bars represent standard deviation from at least three biological replicates. 
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found this possibility unlikely given that most available 
evidence does not support inhibition of Ser2p phospha- 
tase activity at gene ends (Bataille et al. 2012; Fuda et al. 
2012). Finally, our own preliminary analysis found that 
Ser2p was not increased prematurely upon Fcpl depletion 
(data not shown). 

Our observation of reciprocal coupling between CPA- 
mediated Pol II pausing and CTD modification is an 
important conceptual advance in understanding cotran- 
scriptional pre-mRNA processing. We found that early 
events in 3 ' end formation — specifically, Pol II pausing — 
promote high levels of Ser2p. This modification is in turn 
required for the efficient recruitment of a CPA complex 
and subsequent 3' end formation. Thus, a pA signal is at 
least partly responsible for generating the CTD state 
necessary for its processing. We speculate that analogous 
coupling might be used in other cases of cotranscriptional 
RNA maturation, in particular those requiring Ser2p. The 
mechanism that we propose parallels another case of 
reciprocal coupling between pre-mRNA maturation and 
the chromatin template whereby Histone modification 
and splicing influence one another (Luco et al. 2010; de 
Almeida et al. 2011; Kim et al. 2011; Bieberstein et al. 
2012). 

Materials and methods 

Primers 

A list of primers, AMO sequences, and siRNA targets is provided 
in Supplemental Table 1 . 

Plasmids 

PWT and PpAm HEK293 cell lines were already described 
(Davidson and West 2013). For gRNA expression, the gRNA 
AAVSl-Tl plasmid (41817, Addgene) was modified by PGR to 
include sequences complementary to MYC (see Supplemental 
Table 1). dCas9 was made using the hCas9 Dl OA plasmid (41816, 
Addgene) as a template for site-directed mutagenesis whereby 
His841 was changed to Ala. To make the NSIA cell lines, 
pcDNA-NSl (a kind gift from Ervin Fodor) was amplified with 
NSIF and NSIR. The product was then inserted into a vector 
prepared by amplifying pcDNAS FRT/TO, containing human 
P-globra, with promR and PpAF primers. To make mutant NSl, 
this plasmid was amplified with NSlmF and NSlmR primers. 

Cell culture 

Cells were grown in DMEM supplemented with 10% fetal calf 
serum. Act D was used at 10 ^xg/mL. Electroporation of morphoHnos 
was performed on a confluent 10-cm-diameter dish of cells using 
10 |jlM AMO in 400 jxL of DMEM using a 4-mm gap cuvette (960 
|jlF, 280 V in a Bio-Rad gene pulser). RNA was isolated 3 h after 
electroporation. For transient transfection, 6 |jLg each of gRNA 
and dCas9 constructs were transfected using JetPrime ( Polyp lus), 
and experiments were performed 48 h later. For RNAi, 20% 
confluent 60-mm dishes were transfected with 18 |jlL of 2 |jlM 
siRNA and 5 [xL of RNAiMAX (Life Technologies) and left for 72 
h. The process was then repeated after replating of cells to 20% 
confluence. Stable cell line generation was with the Flp-IN 
system (Life Technologies) and is described elsewhere (Davidson 
and West 2013). 



Antibodies 

Antibodies used were Pol II (N20:sc899, Santa Cruz Biotechnol- 
ogy), Ser2p (3E10, ChromoTek), SerSp (3E8, ChromoTek), Ser7p 
(4E12, ChromoTek), CstF77 (C0249, Sigma), CPSF73 (C2747, 
Sigma), CPSF30 (SAB4200170, Sigma; A301-584A, Bethyl Labo- 
ratories), Tubulin (T6557, Sigma), Cdkl2 for Western (NBIOO- 
87011, Novus Biologies), PABPNl (75855, Abeam), and Cdkl2 
for Chip (57311, Abeam). 

RNA analysis 

Total RNA was isolated using Trizol (Life Technologies) and 
DNase-treated with Turbo DNase (Life Technologies). For real- 
time PCR analysis, 1 |jLg of RNA was reverse-transcribed using 
Inprom II (Promega). Parallel reactions were performed in the 
absence of reverse transcriptase. One-twentieth of the cDNA 
mix was used for real-time PCR using 5-10 pmol of forward and 
reverse primer and Brilliant III SYBR mix (Agilent Technologies) 
in a Qiagen Rotorgene machine. Differences were calculated 
using comparative quantitation. 

Chip 

In a confluent 10-cm dish, HeLa or HEK293 cells were cross- 
linked in 1 % formaldehyde for 10 min at room temperature (4 X 
60-mm dishes were used for RNAi experiments). Cross-links 
were quenched in 125 mM glycine, and cells were rinsed in PBS 
and then collected by centrifugation (500g for 5 min). Cells were 
sonicated in 400 jxL of RIPA buffer (150 mM NaCl, 1% NP40, 
0.5% DOC, 0.1% SDS, 50 mM Tris-Cl at pH 8, 5 mM EDTA at 
pH 8) for 30 sec on, 30 sec off for 12 min on high in a Bioruptor. 
Chromatin was centrifuged at 13,000g for 10 min, and superna- 
tants added to 20 |jlL of protein A/G Dynabeads (Invitrogen) 
preincubated in 500 |jlL of RIPA buffer for 2 h with antibody 
(generally 2-4 jxg per ChIP). No antibody controls were per- 
formed in parallel. Following overnight rotation at 4°C, the beads 
were washed twice in RIPA buffer, four times in wash buffer (500 
mM NaCl, 1% NP40, 1% DOC, 100 mM Tris.Cl at pH 8.5), and 
twice in RIPA buffer. Immune complexes were eluted in 0. 1 M 
NaHC03/l%SDS (15 min of rotation at room temperature). 
Cross-links were reversed for 5 h at 65°C (250 mM NaCl, 1 |jLg 
RNase A). DNA was phenol-chloroform-extracted and ethanol- 
precipitated. Generally, l/50th was used for each real-time PCR 
reaction. 

RNA-IP 

For RNA-IP, the same protocol was used as for ChIP except 
extracts were DNase-treated (30 min at room temperature) before 
immunoprecipitation as well as after elution (1 h at 37°C). Im- 
munoprecipitations were performed in the presence of RNase 
inhibitor. Half of the elution was reverse-transcribed, and the 
other half was used for a parallel minus reverse transcriptase 
control. 

Native high-salt ChIP 

A 10-cm dish of HeLa cells was scraped in PBS and then lysed in 
5 mL of HLB before underlay ering with 1 mL of HLB -i- 10% 
sucrose. Nuclei were pelleted by spinning at 500g for 5 min. 
These were resuspended in 1 mL of MNase buffer, and chromatin 
was digested for 10 min at 37°C with 1 U of micrococcal nuclease 
(Sigma). Nuclei were then lysed in high-salt RIPA buffer (500 
mM NaCl, 1% NP40, 0.5% DOC, 0.1% SDS, 50 mM Tris-Cl at 
pH 8, 5 mM EDTA at pH 8). Extracts were immunoprecipitated 
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overnight in high-salt RIPA buffer in the presence of beads to 
which antibodies had been coupled. The next day, these were 
washed four times for 5 min in high-salt RIPA buffer and then 
once in RIPA buffer. Elution and analysis were performed as for 
standard ChlP. 

Chip quantitation 

Our Chip experiments often required data obtained from differ- 
ent antibodies to be plotted on the same graph. Moreover, as 
is common for ChIP, different batches of the same antibody 
frequently varied in their effectiveness. Importantly, however, 
the pattern of recruitment was always robust. To account for 
this, values were plotted as relative percentage of input, where 
we calculated the percentage of input for each experiment before 
maximum immunoprecipitation as a relative percent input of 1 . 
Each plot displays the average and standard deviation of at least 
three experiments quantitated in this way. 

4-thio UTP NRO 

A detailed protocol for 4-thio UTP NRO is described in Davidson 
et al. ( 2012). 

Acknowledgments 

We thank Ervin Fodor (Oxford University) and Adolf o Garcia- 
Sastre (Mount Sinai School of Medicine) for the NSl A expression 
plasmid and NSl A antibody, respectively. We also thank Atlanta 
Cook, Gracjan Michlewski, and Marie-Joelle Schmidt for criti- 
cally reading the manuscript, and David Tollervey for comments 
during the revision process. This work was supported by a Well- 
come Trust Research Career Development award to S.W., and 
an EMBO long-term post-doctoral fellowship awarded to L.M. 
Grant numbers were 088499/Z/09/2 (to S.W) and Centre core 
grant from the Wellcome Trust (092076). Funding for the open 
access charge was from The Wellcome Trust. 

References 

Ahn SH, Kim M, Buratowski S. 2004. Phosphorylation of serine 
2 within the RNA polymerase II C-terminal domain couples 
transcription and 3' end processing. Mol Cell 13: 67-76. 

Anamika K, Gyenis A, Poidevin L, Poch O, Tora L. 2012. RNA 
polymerase II pausing downstream of core histone genes is 
different from genes producing polyadenylated transcripts. 
PLoS ONE 7: e38769. 

Bartkowiak B, Liu P, Phatnani HP, Fuda NJ, Cooper JJ, Price DH, 
Adelman K, Lis JT, Greenleaf AL. 2010. CDK12 is a tran- 
scription elongation-associated CTD kinase, the metazoan 
ortholog of yeast Ctkl. Genes Dev 24: 2303-2316. 

Bataille AR, Jeronimo C, Jacques PE, Laramee L, Fortin ME, 
Forest A, Bergeron M, Hanes SD, Robert F. 2012. A universal 
RNA polymerase II CTD cycle is orchestrated by complex 
interplays between kinase, phosphatase, and isomerase en- 
zymes along genes. Mol Cell 45: 158-170. 

Berg MG, Singh LN, Younis I, Liu Q, Pinto AM, Kaida D, Zhang 
Z, Cho S, Sherrill-Mix S, Wan L, et al. 2012. Ul snRNP 
determines mRNA length and regulates isoform expression. 
Cell 150: 53-64. 

Bieberstein NI, Oesterreich EC, Straube K, Neugebauer KM. 
2012. First exon length controls active chromatin signatures 
and transcription. Cell Rep 2: 62-68. 

Blazek D, Kohoutek J, Bartholomeeusen K, Johansen E, Hulinkova 
P, Luo Z, Cimermancic P, Ule J, Peterlin BM. 201 1 . The CycHn 
K/Cdkl2 complex maintains genomic stability via regulation 
of expression of DNA damage response genes. Genes Dev 25: 
2158-2172. 



Brookes E, de Santiago I, Hebenstreit D, Morris KJ, Carroll T, Xie 
SQ, Stock JK, Heidemann M, Eick D, Nozaki N, et al. 2012. 
Polycomb associates genome-wide with a specific RNA poly- 
merase II variant, and regulates metabolic genes in ESCs. Cell 
Stem Cell 10: 157-170. 

Buratowski S. 2009. Progression through the RNA polymerase II 
CTD cycle. Mol Cell 36: 541-546. 

Chapman RD, Heidemann M, Albert TK, Mailhammer R, 
Flatley A, Meisterernst M, Kremmer E, Eick D. 2007. Tran- 
scribing RNA polymerase II is phosphorylated at CTD residue 
serine-7. Science 31S: 1780-1782. 

Cho EJ, Kobor MS, Kim M, Greenblatt J, Buratowski S. 2001. 
Opposing effects of Ctkl kinase and Fcpl phosphatase at Ser 
2 of the RNA polymerase II C-terminal domain. Genes Dev 
15: 3319-3329. 

Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu 
X, Jiang W, Marraffini LA, et al. 2013. Multiplex genome 
engineering using CRISPR/Cas systems. Science 339: 819- 
823. 

Dantonel JC, Murthy KG, Manley JL, Tora L. 1997. Transcrip- 
tion factor TFIID recruits factor CPSF for formation of 3 ' end 
of mRNA. Nature 389: 399-402. 

Davidson L, West S. 2013. Splicing-coupled 3' end formation 
requires a terminal splice acceptor site, but not intron 
excision. Nucleic Acids Res 41: 7101-7114. 

Davidson L, Kerr A, West S. 2012. Co-transcriptional degrada- 
tion of aberrant pre-mRNA by Xrn2. EMBO / 31: 2566-2578. 

de Almeida SF, Grosso AR, Koch F, Fenouil R, Carvalho S, 
Andrade J, Levezinho H, Gut M, Eick D, Gut I, et al. 2011. 
Splicing enhances recruitment of methyltransf erase HYPB/ 
Setd2 and methylation of histone H3 Lys36. Nat Struct Mol 
Biol 18: 977-983. 

Devaiah BN, Lewis BA, Cherman N, Hewitt MC, Albrecht BK, 
Robey PG, Ozato K, Sims RJ 3rd, Singer DS. 2012. BRD4 is 
an atypical kinase that phosphorylates serine2 of the RNA 
polymerase II carboxy-terminal domain. Proc Natl Acad Sci 
109: 6927-6932. 

Dominski Z, Yang XC, Purdy M, Wagner EJ, Marzluff WF. 2005. 
A CPSF- 73 homologue is required for cell cycle progression 
but not cell growth and interacts with a protein having 
features of CPSF-100. Mol Cell Biol 25: 1489-1500. 

Dye MJ, Proudfoot NJ. 2001. Multiple transcript cleavage pre- 
cedes polymerase release in termination by RNA polymerase 
II. Cell 105: 669-681. 

Eggermont J, Proudfoot NJ. 1993. Poly (A) signals and transcrip- 
tional pause sites combine to prevent interference between 
RNA polymerase II promoters. EMBO J 12: 2539-2548. 

Eick D, Geyer M. 2013. The RNA polymerase II carboxy- 
terminal domain code. Chem Rev 113: 8456-8490. 

Fuda NJ, Buckley MS, Wei W, Core LJ, Waters CT, Reinberg D, 
Lis JT. 2012. Fcpl dephosphorylation of the RNA polymerase 
II C-terminal domain is required for efficient transcription of 
heat shock genes. Mol Cell Biol 32: 3428-3437. 

Garrido-Lecca A, Blumenthal T. 2010. RNA polymerase II C- 
terminal domain phosphorylation patterns in Caenorhabditis 
elegans operons, polycistronic gene clusters with only one 
promoter. Mol Cell Biol 30: 3887-3893. 

Gilbert LA, Larson MH, Morsut L, Liu Z, Brar GA, Torres SE, 
Stern-Ginossar N, Brandman O, Whitehead EH, Doudna JA, 
et al. 2013. CRISPR-mediated modular RNA-guided regula- 
tion of transcription in eukaryotes. Cell 154: 442-451. 

Glover-Cutter K, Kim S, Espinosa J, Bentley DL. 2008. RNA 
polymerase II pauses and associates with pre-mRNA process- 
ing factors at both ends of genes. Nat Struct Mol Biol 15: 71-78. 

Gnatt AL, Cramer P, Fu J, Bushnell DA, Kornberg RD. 2001. 
Structural basis of transcription: An RNA polymerase II 



GENES & DEVELOPMENT 355 



Davidson et al. 



elongation complex at 3.3 A resolution. Science 292: 1876- 
1882. 

Gromak N, West S, Proudfoot NJ. 2006. Pause sites promote 
transcriptional termination of mammalian RNA polymerase 
II. Mol Cell Biol 26: 3986-3996. 

Grosso AR, de Almeida SF, Braga J, Carmo-Fonseca M. 2012. 
Dynamic transitions in RNA polymerase II density profiles 
during transcription termination. Genome Res 11: 1447- 
1456. 

Gu B, Eick D, Bensaude O. 2012. CTD serine-2 plays a critical 
role in splicing and termination factor recruitment to RNA 
polymerase II in vivo. Nucleic Acids Res 41: 1591-1603. 

Hintermair C, Heidemann M, Koch F, Descostes N, Gut M, Gut 
I, Fenouil R, Ferrier P, Flatley A, Kremmer E, et al. 2012. 
Threonine-4 of mammalian RNA polymerase II CTD is 
targeted by Polo-like kinase 3 and required for transcrip- 
tional elongation. EMBO J 31: 2784-2797. 

Kaida D, Berg MG, Younis I, Kasim M, Singh LN, Wan L, 
Dreyfuss G. 2010. Ul snRNP protects pre-mRNAs from pre- 
mature cleavage and polyadenylation. Nature 468: 664-66H. 

Kim M, Krogan NJ, Vasiljeva L, Rando OJ, Nedea E, Greenblatt 
JF, Buratowski S. 2004. The yeast Rati exonuclease promotes 
transcription termination by RNA polymerase II. Nature 
432: 517-522. 

Kim S, Kim H, Fong N, Erickson B, Bentley DL. 2011. Pre- 
mRNA splicing is a determinant of histone H3K36 methyl- 
ation. Proc Natl Acad Sci 108: 13564-13569. 

Kolev NG, Steitz JA. 2005. Symplekin and multiple other 
polyadenylation factors participate in 3 '-end maturation of 
histone mRNAs. Genes Dev 19: 2583-2592. 

Komarnitsky P, Cho EJ, Buratowski S. 2000. Different phos- 
phorylated forms of RNA polymerase II and associated 
mRNA processing factors during transcription. Genes Dev 
14: 2452-2460. 

Luco RF, Pan Q, Tominaga K, Blencowe BJ, Pereira-Smith OM, 
Misteli T. 2010. Regulation of alternative splicing by histone 
modifications. Science 317: 996-1000. 

Mah P, Yang L, Esvek KM, Aach J, Guell M, DiCarlo JE, Norville 
JE, Church GM. 2013. RNA-guided human genome engi- 
neering via Cas9. Science 339: 823-826. 

Mandel CR, Kaneko S, Zhang H, Gebauer D, Vethantham V, 
Manley JL, Tong L. 2006. Polyadenylation factor CPSF-73 is 
the pre-mRNA 3 '-end-processing endonuclease. Nature 444: 
953-956. 

Mapendano CK, Lykke- Andersen S, Kjems J, Bertrand E, Jensen 
TH. 2010. Crosstalk between mRNA 3' end processing and 
transcription initiation. Mol Cell 40: 410-422. 

Mayr C, Bartel DR 2009. Widespread shortening of 3'UTRs by 
alternative cleavage and polyadenylation activates onco- 
genes in cancer cells. Cell 138: 673-684. 

McCracken S, Fong N, Yankulov K, Ballantyne S, Pan G, 
Greenblatt J, Patterson SD, Wickens M, Bentley DL. 1997. 
The C-terminal domain of RNA polymerase II couples 
mRNA processing to transcription. Nature 385: 357-361. 

Nag A, Narsinh K, Martinson HG. 2007. The poly(A)-dependent 
transcriptional pause is mediated by CPSF acting on the body 
of the polymerase. Nat Struct Mol Biol 14: 662-669. 

Nemeroff ME, Barabino SM, Li Y, Keller W, Krug RM. 1998. 
Influenza virus NSl protein interacts with the cellular 30 
kDa subunit of CPSF and inhibits 3 'end formation of cellular 
pre-mRNAs. Mol Cell 1: 991-1000. 

Orozco IJ, Kim SJ, Martinson HG. 2002. The poly(A) signal, 
without the assistance of any downstream element, directs 
RNA polymerase II to pause in vivo and then to release 
stochastically from the template, f Biol Chem 177: 42899- 
42911. 



Osheim YN, Proudfoot NJ, Beyer AL. 1999. EM visualization of 
transcription by RNA polymerase II: Downstream termina- 
tion requires a poly( A) signal but not transcript cleavage. Mol 
Cell 3: 379-387. 

Osheim YN, Sikes ML, Beyer AL. 2002. EM visualization of Pol 
II genes in Drosophila: Most genes terminate without prior 
3' end cleavage of nascent transcripts. Chromosoma 111: 
1-12. 

Peterlin BM, Price DH. 2006. Controlling the elongation phase 
of transcription with P-TEFb. Mol Cell 13: 297-305. 

Preker P, Almvig K, Christensen MS, Valen E, Mapendano CK, 
Sandelin A, Jensen TH. 2011. PROMoter uPstream Tran- 
scripts share characteristics with mRNAs and are produced 
upstream of all three major types of mammalian promoters. 
Nucleic Acids Res 39: 7179-7193. 

Proudfoot NJ. 2012. Ending the message: Poly (A) signals then 
and now. Genes Dev 25: 1770-1782. 

Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin 
AP, Lim WA. 2013. Repurposing CRISPR as an RNA-guided 
platform for sequence-specific control of gene expression. 
Cell 152: 1173-1183. 

Rahl PB, Lin CY, Sella AC, Flynn RA, McCuine S, Burge CB, 
Sharp PA, Young RA. 2010. c-Myc regulates transcriptional 
pause release. Cell 141: 432-445. 

Schroeder SC, Schwer B, Shuman S, Bentley D. 2000. Dynamic 
association of capping enzymes with transcribing RNA 
polymerase II. Genes Dev 14: 2435-2440. 

Schwer B, Shuman S. 201 1. Deciphering the RNA polymerase II 
CTD code in fission yeast. Mol Cell 43: 311-318. 

Shi Y, Di Giammartino DC, Taylor D, Sarkeshik A, Rice WJ, 
Yates JR 3rd, Frank J, Manley JL. 2009. Molecular architec- 
ture of the human pre-mRNA 3' processing complex. Mol 
Cell 33: 365-376. 

Tian B, Graber JH. 2012. Signals for pre-mRNA cleavage and 
polyadenylation. Wiley Interdiscip Rev RNA 3: 385-396. 

Wang H, Yang H, Shivalila CS, Dawlaty MM, Cheng AW, Zhang 
F, Jaenisch R. 2013. One-step generation of mice carrying 
mutations in multiple genes by CRISPR/Cas-mediated ge- 
nome engineering. Cell 153: 910-918. 

West S, Proudfoot NJ. 2009. Transcriptional termination en- 
hances protein expression in human cells. Mol Cell 33: 354- 
364. 

Whitelaw E, Proudfoot N. 1986. a-Thalassaemia caused by 
a poly(A) site mutation reveals that transcriptional termina- 
tion is linked to 3' end processing in the human al globin 
gene. EMBO J 5: 2915-2922. 

Zhao J, Hyman L, Moore C. 1999. Formation of mRNA 3' ends 
in eukaryotes: Mechanism, regulation, and interrelation- 
ships with other steps in mRNA synthesis. Microbiol Mol 
Biol Rev 63: 405-445. 



356 GENES & DEVELOPMENT 



